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A METHOD OF TRANSMITTING OR RETRANSMITTING PACKETS 

IN A COMMUNICATION SYSTEM 

BACKGROUND OF THE INVENTION 
[0001] Voice communication has been the dominant application in 
wireless networks. Cellular standards, such as Global System for 
Mobile Communication (GSM) and IS-95 were originally intended for 
voice traffic only. With the recent growth of the Intemet, voice and 
high-speed data are now offered over wireless networks. One issue 
has been improving the physiccd layer efficiency of packet data 
services. 

[0002] Third generation wide-area wireless networks include the 
Introduction of packet data services over a CDMA network, in 
particular, the introduction of fast rate adaptation and smart 
scheduling over CDMA downlinks. More recently, incremental 
redundancy protocols have been proposed with a view towards gaining 
"time diversity" for short transmissions. One such protocol, which 
works well with the channel chatnges and smart scheduling, is the 
Adaptive Asynchronous Incremental Redundancy (A^IR) protocol. 
Versions of this protocol were recently adopted as pairt of the IxEVDV 
as well as high-speed downlink packet access ("HSDPA") standards. 
[00031 Fig. 1 is a block diagram illustrating a portion of a prior art 
wireless communication system. As shown in Fig. 1, the wireless 
access network typically includes an access terminal (AT) 130, a 
communication transmission device such as a Base Station 
Transmission System (BTS) 120, and a communication system control 
device, such as a Base Station Controller (BSC) 110. The BSC 110 
receives data from a core network and passes the data to the BTS 120 
so that the BTS 120 can transmit the data to the AT 130 over a 
wireless link 125. Data is communicated in the reverse direction as 
well and passed from the AT 130 to the core network. The connection 
between the BTS 120 and BSC 1 10 is typically through a Tl link 1 15. 
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A cluster of BTS units may be connected to a router, which in turn is 
connected to the controller via a T3 link. 

[0004] A key to efficient delivery of the data packets transmitted 
from the BTS 120 to the AT 130, has become how efftcient the 
transmission/re-transmission of data is scheduled and transmitted, 
given the redundant nature of a protocol, such as A^IR. It should be 
imderstood that Fig. 1 is merely exemplary of the networks which may 
host such transmission/retransmission. 

[00051 As an introduction to MCS, the concept of a hybrid 
automatic repeat request (ARQ) protocol is helpful. Considering the 
forward link of a wireless network, e.g. wireless link 125, and given an 
information block and some knowledge of the channel conditions, the 
network may proceed to choose a code rate and modulation, then 
transmit packets of data accordingly. AT 130, upon reception of the 
packet, conducts a cyclic redundancy check (after demodulation and 
decoding) in order to determine the efficiency of the trsinsmission. 
[0006] In second generation, and early versions of third generation 
data systems, if the CRC check did not pass, the packet would be 
deemed in error and it is left to the link layer ARQ, (e.g., the radio link 
protocol (RLP) and network layer ARQ protocols, such as the 
transmission control protocol (TCP)) to recover from the errors. In 
other words, complete re-transmission of physical layer packets was 
the only way to recover from link errors. However, hybrid ARQ 
(HARQ) protocols, which work on the physical layer, allow for the 
reuse of S3niibols/coded bits that were deemed erroneous, via "soft 
combining" of frames, with a view towards better utilization of physical 
layer resources. 

[0007] For reuse, if after one transmission from BTS 120, an error is 
detected, the receiver does not discard the bits but Instead retains 
them with the hope of combining them with future transmissions. 
Each trsinsmission on its own may be erroneous, but the ensemble of 
transmissions may allow the frame to recover from link errors. Two 
conventional classes of hybrid ARQ protocols include chase combining 
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(CC) and incremental redundancy (IR) . The former protocol allows for 
combining of modulation sjnnbols, whereas the latter involves the 
combining after demodulation but prior to decoding. 
[0008] Referring to A^IR, "adaptive" refers to the channel sensitive 
nature of the protocol where the modulation and chetnnel coding 
scheme (MCS) is varied between retransmissions. "Asynchronous" 
refers to the time asynchrony that is allowed between successive 
transmissions (in order to work with the underlying scheduling 
algorithm). This leads to chaQlenges of designing "smart scheduling" 
algorithms as well as **rate (or modulation and coding scheme (MCS)) 
selection." 

[0009] Two constraints imposed by conventional HARQ protocols 
operating with other upper layer protocols on physical layer resource 
allocation are the residual packet error rate (PER) constraint and the 
maximum number of re~transmissions constraint. In order to ensure 
that individusd user "packet call" delays are kept within limits, a 
reasonably low error rate should be presented to upper layers. In the 
interest of controlling average packet call delay, the number of 
physical layer (HARQ) re-transmissions permitted for a single packet 
cannot be unbounded. The notion of residucd PER makes sense only 
with a finite number of HARQ re- transmissions. In practical systems 
the number of such re-transmissions is quite limited (e.g. 4). 

SUIMMARY OF THE HWENTION 
[0010] Exemplary embodiments of the present invention are 
directed to more efficient and accurate techniques of utilizing the 
limited number of re-transmissions by improving MCS selection to 
transmit the required packets with a reduced amount of average 
transmission overage. 

[0011] Exemplary embodiments of the present invention are also 
directed to techniques for efficient optimal-approaching MCS 
selection. Exempl£iry embodiments of the present invention are also 
directed to a technique for rate or MCS selection aind scheduling of 
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users in CDMA systems with Adaptive Modialation and Coding and 
Hybrid ARQ. for example, IxEVDV and other high-speed downlink 
packet access (HSDPA) systems. 

[0012] More specifically, exemplary embodiments of the present 
invention are directed to a method of transmitting or retransmitting a 
packet in a communication system comprising determining a 
modulation and coding scheme (MCS) and corresponding resource 
allocation as a function of at least one of a determined past, a 
determinable current, and a future statistic of at least one of channel 
conditions and resource allocation and transmitting a coded 
modulated version of the packet in seiid communication system 
according to the MCS. 

[0013] Additional features and advantages of exemplary 
embodiments of the present invention will be more fully apparent from 
the following detaiiled description of example embodiments, the 
appended claims and the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0014] Fig. 1 is a block diagram illustrating a portion of a prior art 
wireless conmiunication system, capable of hosting the method of 
exemplary embodiments of the present invention. 

[0015] Fig. 2 is a block diagram illustrating a method of 
transmitting a packet using a modulation and coding scheme in 
accordance with an exemplary embodiment of the present invention. 
[0016] Fig. 3 is a block diagram illustrating a method of 
transmitting a packet using a modulation and coding scheme in 
accordance with another exemplary embodiment of the present 
invention. 

[0017] Figure 4 illustrates an exemplary flow chart of the operation 
of statistical bin packing that involves both offline computed look up 
tables and online computations in real time in accordance with an 
exemplary embodiment of the present invention. 
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[0018] Hie accompanying drawings aire intended to depict example 
embodiments of the invention and should not be interpreted to limit 
the scope thereof; and not to be considered as drawn to scale unless 
explicitly noted. 

DETAILED DESCRIPTION OF THE EXEMPLARY EMBODIMENTS 
[0019] Exemplary embodiments of the present invention provide a 
method of more eflBciently and/or more accurately utilizing a limited 
number of retransmissions by improving MCS selection to transmit 
the required packets more efficiently. To better understand the 
method of the exemplary embodiments of the present invention, some 
mathematical concepts used In deriving the exemplary embodiments 
of the present invention will be e5q)lained. These mathematical 
concepts include a brief discussion on notation, aggregate 
approximations, quality of service (QoS) and delay constraints, the 
"bin packing" problem, examples of statistical bin packing, including 
several alternative algorithms, cost fimctions, overheads, folding rate 
selection, and available Walsh space. Exemplary embodiments of the 
present invention implementing these mathematical concepts will be 
explained with reference to Figures 2-4. 

Notation and Brief Review 
[0020] As discussed above, for a hybrid ARQ protocol and the 
forwEurd link of a wireless network, given an information block and 
some knowledge of the channel conditions, a network can choose a 
code rate and modulation and transmit accordingly. A mobile, upon 
reception of the packet, conducts a cyclic redimdancy check (aifter 
demodulation and decoding) in order to determine the efBcacy of the 
transmission. In second generation, and early versions of third 
generation data systems, if the CRC check does not pass then the 
packet is deemed in error and it is left to link layer ARQ such as RLP 
and network layer ARQ protocols such as TCP to recover from these 
errors. In other words, "complete" re-transmissions of physical layer 
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packets is the only way to recover from wireless link errors. On the 
other hand, hybrid ARQ protocols which work on the physical layer 
allow for the "re-use" of s3niibols/coded bits that were deemed 
erroneous, via "soft combining" of frames, with a view towards better 
utilization of physical layer resources. In other words, if after one 
transmission an error is detected the receiver does not discard the bits 
but instead retains them with the hope of combining them with future 
transmissions - each transmission on its own may be erroneous but 
the ensemble of transmissions may allow the frame to recover from 
any link errors. It is noted that the present invention is also applicable 
to the reverse link as well. For example, in the reverse link, when a 
mobile trsinsmits a packet, the base station upon reception may check 
the CRC and send an ack/nack back to the mobile; if it is a nack, 
then the mobile can transmit the subsequent packet and the base 
station can perform soft combining or other functions. For the 
reverse link, some minor modifications to the equations may apply 
due to reverse link specific properties,for instance the SNR of the pilot 
may be factored in the calculations to determine the "traffic to pilot 
ratio" along with the MCS). 

[0021] Two of the most popular classes of hybrid ARQ protocols 
include chase combining (CC) and incremental redimdancy (IR). The 
former protocol allows for combining of modulation symbols, whereas 
the latter involves the combining after demodulation but prior to 
decoding. Recentiy, in the IxEVDV system standardized by 3GPP2 as 
well as the HSDPA system standardized by 3GPP, IR based protocols 
applied to short frames were adopted in the forward link to exploit 
channel variations with finer time granularity but yet deriving the 
benefits of time diversity that large frames enjoy. For reasons set forth 
above, these protocols may also be used in the reverse link.. 
[0022] If B( ) is the information block or packet size to be 
transmitted and T( ) is the transmission time interval (i ll) or packet 
duration (sometimes referred to as Nimiber of Slots (NOS)). In the case 
of HSDPA, T is held as a system constant (for example, 2 ms), whereas 
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in the case of EVDV, T( ) e {1.2,4} slots where one slot is fixed at 1.25 
ms. The information data rate R() for the transmission is: 



R() = B()/T() (1) 

[00231 If |i is the modulation and coding scheme (MCS) used for the 
transmission, \i denotes essentially the pair (m, r) where m is the 
modulation order (e.g. m = 1 for BPSK, 2 for QPSK, 3 for 8-PSK and 4 
for 16- QAM) and r is the code rate. The coded symbols (numbering 
B/r) are always picked as a subset of a "mother" code in both HSDPA 
(of rate 1/3) and EVDV (of rate 1/5) and this subset is referred to as a 
"sub -packet" or "redundancy version". In the reverse link, the mother 
code rate may be different - for example In EVDV, the mother code 
may be rate V4. In HSDPA, m is allowed only the values 2 or 4 whereas 
in EVDV, m can take the values 2, 3 or 4. In the reverse link m could 
also be 1 for certain rates. Rchip denotes the chip rate (1.2288 Mcps for 
EVDV and 3.84 Mcps for HSDPA) and SF denotes the Spreading 
Factor (SF=16 for HSDPA and SF=32 for EVDV). In the reverse link, 
SFs may also be 2 or 4.W denotes the number of Walsh or 
channelization codes used for the multi-coded transmission. W <= 
Wavaii where Wavaii Is the number of Walsh codes available for the 
packet data channel after allocation for other dedicated channels (e.g. 
voice service). Wavaii<= 15 for HSDPA and Wavaii <= 28 for EVDV. 

[0024] The quantities defined above are related by: 

R = B/T = m.r. W.Rchip/SF (2) 

[0025] In a system like HSDPA or EVDV, Rchip and SF are system 
constants that do not change with time. The above relation implies 
that for each value of the modulation order m, \i is determined by the 
parameters (B, W, T). In EVDV, for fixed values of (B. W. T) the 
corresponding (m, r) pair i.e. |i is pre-decided. In other words, the 
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choice of m is not signaled over the air so that the mobile may 
calculate the code rate r of the transmission as both components of 
the pair (m, r) are both implicitly known to the mobile once (B. W, T) 
are chosen by the network and signaled to the mobile i.e. |i is a 
function of variables (B, W, T). 

[0026] In EVDV, r is restricted to be less than 1 for each 
transmission i.e. the size of any sub-packet in terms of coded bits is 
always greater than B. In HSDPA, the value of m (2 or 4) is signaled 
over the air to the mobile along with (B, W) so that it may calculate r, 
and furthermore it is possible for r > 1. Non self-decodable formats 
with r > 1 may be picked only in the case of IR re-transmissions so 
that the effective code rate of all accumulated sub-packets < 1 . Thus 
in HSDPA, n is a function of variables (B, W, m). 

[0027] If c(i) denotes the predicted channel C/I based on the 
feedback reported by the mobile that is used for scheduling, MCS 
selection for the re-transmission (or sub-packet) index i and c'(i) 
denotes the actual C/I encountered by the mobile during that re- 
transmission. The first transmission (sub-packet) of the packet is 
indexed by i = 1 and subsequent re-transmissions increment the value 
of i. It is assumed in defining c(i) and c'(i) that they are based on some 
constant nominal power fraction (Ec/Ior)nom allocated to the packet 
data channel. It can be assumed that (Ec/Ior)nom = 1- c'(i) can differ 
firom c(i) due to the prediction error in predicting the channel fading 
process and the SIR estimation errors in the mobile reports. The Ec/W 
(power fraction) difference between the nominal value assumed for c(i) 
and the actual available value for c'(i) is separately accounted for. 
Multi-path channel effects and gains due to advanced receivers (other 
than RAKE) such as equalizers and multi-path interference cancellers 
are assimied accounted for in the equivalent AWGN C/I c(i) reported 
by the mobile. 

[0028] P(c*(i), ... c'(M) I c(i), c(i-k) Vk>0) is a probability density 
function that is known to the base station, based on empirical 
observations of scheduling, quality based feedback etc. Given 
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asynchronous scheduling operation. Independence between events 
separated by long scheduling Intervals can be invoked to factorize the 
distribution and simplify as: 

P(c'(i), ... c'(M) I c(i), c(i-k) Vk>0) = P(c'(i)|c(i) c(i-k) Vk>OJ.P(c'(i+l))....P(c'(M)) (3) 

It is noted that the above simplifications and any associated 
assumptions regarding the pdf, are not necessary, and the algorithms 
described herein also work as without the slmpltfications. 
[0029] Note that the scheduler action of favoring "good" channel 
conditions for a user's transmission is Implicitly accounted for in the 
above definition. If a predictor is used to process the time series of 
mobile feedback to arrive at a predicted value for c(i) then the time 
series history term c(i-k) Vk>0 can be dropped since it is included in 
the computation of c(i) and write: 



[0030] Further a "stationarlty" condition can be invoked as an 
additional assumption about the above pdf i.e. 



[0031] f vik (c'(i)) can be the short-term packet error rate function for 
the MCS M.(i) = ]Xk (of packet size B) that encounters an effective 
channel C/I of c'(i). It is possible to closely approximate f^k (c'(i)) by 
the AWGN error rate fimction f^# (c'(i)Ym(^(i).c'(i)) Yr(^(i). B)Yd ) of a 
reference MCS ji* of packet size B*. The modulation (more precisely de- 
mapptng) and the code rate difference related "penalty" factors Vm, Yr 
respectively and Doppler related penalty factor ya are explained in 
more detail below. 



P(c'(i), ... c'(M) I c(i)) = P(c'(i)lc(i)).P(c'(i+l))....P(c'(M)) 



(4) 



P(c'(i+k), ... c'(M) I c(i+k)) = P(c'(i), ... c'(M-k) | c(i)) 



(5) 
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Aggregate Es/Nt method and penalties 

[0032] A reasonable approximation to predict the Link error 
performance of an information packet transmitted over a fading 
channel using several re-transmlsslons (sub-packets) that may have 
different MCS formats has been determined and approved as part of 
the 3GPP2 IxEVDV Simulation Methodology. This approximation, 
referred to as the Aggregate Es/Nt method, entails weighted 
accimiulation of the modulation symbol energies of each (sample of 
every) re-transmission of that packet. After due scaling, the resulting 
equivalent Eb/No (per information bit) is used to consult a reference 
link performance (waterfall) curve generated on an AWGN channel for 
a standard MCS format (e.g. QPSK 1/5 code rate in EVDV. QPSK 1/3 
code rate in HSDPA). The "weights" applied are the penalties for 
puncturing and de-mapping of higher order modulations (as done in 
real world detectors/ decoders) for any other MCS that was used for 
transmitting a portion of that packet. These penalties are 
appropriately pre- computed to ensure accurate error prediction. The 
approximation essentially works on the premise of block fading (quasi- 
static AWGN) of the transmission during the short sampling intervals 
(picked well below the coherence time). However the method corrects 
for deviations from this assumption at higher vehicle speeds by 
Introduction of a "Doppler" penalty that is essentially the deviation of 
the actual short-term curve from AWGN. In summary, this 
approximation method and improvements thereto, enable the use of 
one "reference" link-error performance curve per block size along with 
a table of penalties to account for the deviations of the 
coding/modulation scheme, and the Doppler environment, from the 
reference curve. 

QoS and Delav Constraints of HARQ re-transmissions 

[0033] Two constraints imposed by practical HARQ protocols 
interoperating with other upper layer protocols on physical layer 
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resource adlocatlon are the Residual Packet Error Rate (QoS) 
constraint and Maximum Nvimber of Re-transmissions constraint. 
[0034] A reasonably low error rate is to be presented to upper layers 
in order to ensure that individual user "packet call" delays are kept 
within limits. Criteria such as Rate(l-PER), that maximize long-term 
user throughput while benefiting network throughput, work well at 
the user level only when flow into a user's buffer is infinite or very 
long. However, in the context of practical and common services such 
as web browsing, the average delay of the delivered web page is of 
some importance and must be assured within reasonable bounds. 
These delays can be exacerbated if the residual physical layer PER is 
high, in turn triggering several high latency upper (radio link control 
or RLC or radio link protocol = RLP) layer re-transmissions. In a 
similar vein, in the interest of controlling average packet call delay, 
the number of physical layer (HARQ) re-transmissions permitted of a 
single packet cannot be assumed very large or unbounded. The notion 
of residual PER presented to RLC layer itself makes sense only with a 
finite number of HARQ re- transmissions. In practical systems the 
nimiber of such re- transmissions is quite limited (e.g. 4). 

The Bm Packing Problem 

[0035] For a given number of transmissions attempted M of a 
packet of size B = B(l) with associated MCS and channel conditions, 
the resulting error performance, i.e. packet error rate (PER) should be 
evaluated. The aggregate Es/Nt method approximates this by boiling 
down all variables of the transmissions into a single equivalent Eb/No 
of a standard reference packet format of size B*, code of rate r* 
(examples being a mother code of 1/5 rate in EVDV , 1/3 rate in 
HSDPA, and V4 for the reverse link of EVDV) and modulation (QPSK in 
EVDV as well as HSDPA). The PER performance is then obtained by 
looking up this equivalent Eb/No on an AWGN waterfall curve of the 
standard reference packet format denoted f^#. In the case of turbo- 
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codes that have steep waterfall curves, a step function approximation 
of fji# is used to gauge the success or failure of M transmissions by 
comparing the aggregated Eb/No against a threshold (Eb/No)*^ i.e. PER 
= 0 or 1 depending on whether the aggregated Eb/No after due 
penalties is respectively > or <= (Eb/No)*. 

[0036] For schemes based on IR, the re-transmissions involve 
sending, as much as possible, a new set of coded bits i.e. non- 
identical code symbols, and hence processed at the code symbol level 
in the decoder after de-mapping. Additionally, in any "adaptive" IR 
scheme, the modulation and coding itself can change across re- 
transmissions. Based on the "aggregate Es/Nt" method, the inequality 
that describes the criterion for successful transmission for A^IR is as 
follows: 



M (6) 

{S Ee/Ui)c'(i) T(i) Ym(^(i), EAr(i)c'(i))}Yr(^(l), 1^(2),. . ^i(M),B(l)) Yd /B(l) >= (E^J /R^hip 
i=l 

[0037] Here, Yni(p(i), Ec/Ior(l)c'(i))} represents the de-mapplng penalty 
associated with the i^^ transmission and is a function of the 
modulation and the received SNR, Yr(u(l)» li(2),..., ]a(M),B(l)) is the 
coding penalty and is a function of the effective code rate after M 
transmissions and yd is the Doppler peneilty. 

[0038] In the case of chase combining (CC), the modulation and 
coding rate is picked independently only for the first transmission and 
remains fixed for re-transmissions i.e. \x[) is held constant across i. 
Also the coded bits are identical copies in each transmission and 
hence modulation symbols may be soft combined. The corresponding 
inequality for modulation level CC being decoded successfully is: 



M M /J\ 

{S E^Ui)c'(i)} T(l) Ym(^(l). XEe/Ior(i)c'(i)) Yr(^(l). B(l)) Yd / B(l) >= (Eb/No/ /Rchip 
i=l i=l 

[0039] For CC, the de-mapping penalty ym is applied only once at 
the end after the summation over all re-transmissions as a function of 
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the accumulated X (c'(i))- The coding penalty yr is computed only once 
for and the same value is applied after any number of re- 

transmissions and does not decrease as in the case of IR. Thus, while 
CC may have a better de-map penalty due to single de-mapping at 
progressively increasing SNRs, it is worse in terms of coding penalty, 
[0040] The Ec/Ior(i) takes into account the control channel overhead 
Ec/Ior^{i) that has to be set aside to ensure that the accompanying 
packet data control channel is decoded with high success probability 
(and low false alarm and high detection probabilities) and is further 
assumed to be proportioned to the number of Walsh codes W(i) 
allocated for transmission i, i.e.. 

Bs/hrin = Mil (8) 

(Ec/Ior)avail ~ (Ec/Ior) (i) Wavail 

[0041] A reasonable assumption is that (Ec/Ior)avaii (on the average) 
and Wavail are constants over the duration of a successful packet 
transmission. Hence Ec/lor(i) is automatically quantized according to 
the granularity of W(i). Also B(l) is quantized in real world systems 
(maxunimi of 4 levels in EVDV and 64 levels in HSDPA) and so are the 
modulation orders m, time intervals T(i) and multi-codes allocated 
W(l). The MCS |i(i) thereby ends up being quantized as well. If the c*(i)s 
are assimied quantized in certain steps (e,g. 1 dB) over a certain 
dynamic range (e.g. 16 or 32 dB) then every quantity in the inequality 
(6) is either a quantized variable or consteint. In particular it has the 
form: 

(9) 

2 p(i) >= P 

[0042] Q{i) is shorthand for allocated resources such as time and 
multi-code bandwidth i.e. (T(i), W(i)) as well as the chosen packet size 
B(l). p(i) is shorthand for what is essentially the chginnel condition c'(i) 
but is also a function of the allocated resource Q{i) as in the case of 
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de-mapping penalty Vm. P corresponds to the target (Eb/No)* post 
scaling. 

[0043] The problem of figuring out the error rate given M 
transmissions and corresponding resource allocations and channel 
conditions can be inverted to one of finding "good" resource 
allocations given a maximum number of re-transmissions Mmax and 
given the statistics of the channel fades. In other words, (9) can be 
rewritten as 



Mroax 

2 i2(i) p(i) >= P ^^^^ 

and seek resource allocation solution O = [^2(1), Q(2), ... O(Mmax)] such 
that some cost function C{£1 ] of the assigned resources is reduced or 
minimized subject to the accompanying constraint of residual PER < 8 
being satisfied i.e., 

min C(Q) s/t 

Mn«x (11) 

PCSi., £2(i) p(i) <= P) < e 

[0044] It c£in be assumed that the cost function of the vector 
resource allocation is a simple simimation of the individual resource 
cost C components, i.e. 



M_ (12) 

c(n) = am)) 

[0045] Appropriate choices for the cost fimction are described in 
more detail below. 
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[0046] If the p(i)s are deterministic, then (10) Is a bin-packing 
problem where Q(i) are the objects of discrete sizes corresponding to 
allocated resources such as time and multi-code bandwidth as well as 
the chosen packet size. p(i) are size altering variables and essentially 
represent the channel conditions c'(i) but are also a function of the 
allocated objects Q{i) as in the case of de-mapping penalty, p is the 
target bin size to be filled by the objects and corresponds to the target 
(Eb/No)^ . However, since the channel conditions are random and the 
p(i)s are size-altering random variables, the formulation of (11) 
becomes a Statistical Bin Packing (SBP) problem. 

[0047] Q(i) £ { 0, Oa. Qh, --- Oavaii} = W. W is a finite ordered set or 
alphabet of resource adlocations that includes the case of zero 
resource allocation. is the "atomic" or smallest resource allocation 
corresponding to the smallest transmission time allocation possible 
and the smallest, i.e. single, Walsh code allocation of standard 
spreading factor. Qavaii is the largest resource allocation corresponding 
to the largest transmission time allocation possible and the largest, 
i.e. all, Walsh codes allocated firom the pool made available for the 
scheduled packet data channel. In other words, for any element with 
index k in the set W, Qa <= Ok <= Ok+i <= ^^avaii. The ordering on the set 
W is based on the product of the number of codes and transmission 
time length. 

[0048] It may be observed that any practical solution to (11) involves 
an "online" or real-time choice of the sequence Q = [Q(l), Q{2), ... 
£2(mmax)] that attempts reduction or minimization of some expected 
normalized cost function such that the bin is flUed with high 
probability (1-e). "Online" means that the whole sequence Q is not 
chosen all at once during the first decision stage and left fixed 
thereafter. Instead, only Q(l) is chosen based on the prediction of p(l) 
and the statistics P(p(l), p(2), .. p(Minax) | predicted p(l)) and then 
choose 0(2) during the second decision stage based on the past choice 
of Q(l) and the channel outcome p(l) and the statistics P(p(2), p(3), .. 
p(Mmax) 1 predicted p(2)) and so on for Q(3) through 0(mmax] i.e., an 
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adaptive "self-correcting" multi-stage policy is used for resource 
allocation/ MCS selection. At every stage indexed by I. attention is 
restricted to deterministic algorithms Ap that yield a deterministic 
output Ci{i) based on all inputs, deterministic and statistical. A good 
choice of by any good algorithm at the time of the first 

transmission would involve making an a priori estimate of what Q(2), 
Q(3), ... 0(M)max) (i.e. the algorithm's own output) would be in the 
future. For example, the choice of Q(2] during the second stage will be 
influenced by the choice of Q(l) and the actual realization of p(l). But 
since p(l) is not exactly known (except statistically) ahead of time 
when is being decided, 0(2) becomes a random variable during 
the first stage. Similarly during the first decision stage, 0(3), ... Q(Mmax) 
also are random variables that represent the outputs of the same 
algorithm during future stages. Likewise at any transmission instant i, 
the variables 0(1), ... Qii) are deterministic whereas Q(i+1), ... Q(Mmax) 
are random variables depending on p(i), p(i+l) etc. Thus using a 
deterministic algorithm or policy Ap, the following choices are made: 

QAp{1) = Ap(predicted p(l)) is deterministic 

QAp{2) = Ap(a^P(l), p(l), predicted p(2)) is deterministic at stage 2 but 
random at stage 1 . 

£2Ap(k) = Ap(i2Ap(i) S2^P(k-lX p(l), ...p(k-l), predicted p(k)) is deterministic at 

stage k but random at stages k-1 and earlier 

Si\Mm^) = Ap(Q^P(lX S2^P(M^ax-lX P(l)> ...p(Mnuix-l), predicted pQA^)) is 
deterministic at stage Mmax but random variable at all earlier stages. 

[0049] The subscript P is to indicate that the channel statistics 
(conditional pdf) is a parameter in deciding the Q^P(i). Hereinafter this 
subscript is dropped for notational ease. 

[00501 The problem then becomes finding the best policy A =* for 
QA(k), k= 1, ... M max i.e. * achieves: 
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A (13) 



.A 



s/t P(Zi=i a\i) p(i) <= P) <= e . 

[0051] Note that the cost function introduced in (1 1) and (12) is now 
expressed as an expectation due to the rgindomness of the algorithm's 
future output. 

[0052] For any fixed policy A and a given prediction of p(l) and 
P(p(l), p(2), ... p(Minax) I predicted p(l)), Q,Hl) is deterministic. So the 
criterion for optimality for k= 1 can be expressed as: 



min A { a^\l)) + E[2i=2 aSi\i))\ } 

^ (14) 

s/t P(2i=, p(i) <= P)<=e 



[0053] In general, for 1 < k <= Mmax the criterion can be expressed 
as: 

Mnnax 

min A { aSi\k)) + E[Si=k*, Q^k-l). p(l), . . .p(k-l)] } 

S/t P(2i.. Q^i) p(i) <= 31 Q^l),...,Q\k-lXp(l), ...p(k-l))<=e 

[0054] Note that in the solution to (15) the number M of re- 
transmissions actually realized beyond which the 0(i)s are zero, is also 
a random variable of course bounded above by Mmax. 
[0055] Fig. 2 is a block diagram illustrating a method 200 of 
transmitting a packet using a modulation and coding scheme in 
accordance with an exemplary embodiment of the present invention, 
as Implemented in the communication system of Fig. 1. In step S205. 
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a network system of BTS 120 from Fig. 1, or its BSC 110 will 
determine the optimal- approaching modulation and coding scheme 
(MCS) to reduce the Packet Error Rate (PER). It will be understood by 
one skilled in the art that the near-optimal-approaching method may 
also be used as the MCS. Likewise, these exemplary methods are not 
limiting. Subsequent to finding the proper MCS 110, the BTS 120 
may trgmsmit the packet of data to the AT 130 in accordance with the 
chosen MCS in step S215. This embodiment is merely illustrative of 
one iteration in a system involving multiple transmissions (e.g., A^IR), 
[0056] Note, in (15) the number M of re- transmissions actually 
realized beyond which the i2(i)s are zero, is also a random variable of 
course bounded above by Mmax. 

[0057] Fig. 3 is a block diagram illustrating a method 300 of 
transmitting a packet using a modulation and coding scheme in 
accordance with another exemplary embodiment of the present 
invention, as implemented in the communication system of Fig. 1. 
Note, like steps firom Fig. 2 are numbered identically In Fig. 3. Thus, 
in step S205 a transmitting apparatus, e.g., BTS 120 from the 
network system of Fig. 1 or its corresponding BSC 110, wUl determine 
the MCS to reduce PER. At step S215, the transmitting apparatus will 
transmit the packet of data to the receiving apparatus, e.g., the BTS 
120 will transmit the packet of data to the AT 130, in accordance with 
the chosen MCS. As noted above, exemplary embodiments of the 
present invention may also be applied to the reverse link. 
[0058] After the data packet is transmitted by BTS 120, there will be 
quality checks to determine whether the packet was received error free 
by AT 130 at step S305. As an example, in the HARQ environment, a 
HARQ Negative Acknowledgement (NACK) indicates there was an 
error, while a HARQ ACK indicates the packet was received without 
error. 

[0059] If the packet was received without error by AT 130, the 
system will determine whether any other data packets remain to be 
transmitted from BTS 120, at step S3 15. If no other data packets 
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remain to be transmitted, the transmission of data packets ends at 
step S335. However, if the system determines that the BTS 120 has 
other data packets to transmit, the BTS 120 will transmit the next 
data to the AT 130 according to the determined MCS. In an 
embodiment of the present Invention, there may be a new 
determination of the MCS to transmit the remaining data packet(s). 
10060] However, if the system determines at step S305 that the data 
was received with an error (NACK), there will be a determination of 
whether the maximum transmissions (Mmax) of the data packet has 
tramspired. If the maximum nxmaber of re-transmissions by the BTS 
120 has transpired, there will be a determination of whether any more 
data packets are waiting to be transmitted at step S3 15 and the 
method is executed as previously discussed. 

[0061] If the Mmax has not been reached at step 8325, the 
retransmission protocol of the system (e.g., A^IR) wUl re-transmit at 
least a portion of the data packet according to the system MCS at step 
S215- The system operates using the method thereafter, as described 
previously. It should be obvious that the communication system 
described herein is exemplary. It should be noted that one or both of 
the "Yes" output from step 8315 and the "No" output from the step 
S325 could be input to step 8205 for a redetermination of the MCS. It 
should also be noted that one of the numerous embodiments of the 
MCS methods described herein may be chosen at step 8215 in the 
present Invention. 

Examples of Statistical Bin Packing 
[0062] As an example, the method of using the optimal-approaching 
embodiment may be illustrated using two transmissions (maximimi) 
in (16) - (21). 

Example Algorithm 1 - An Optimal Algorithm 
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Q\l)= arg min i {a^O + 2 p(i) CX^\2 \ p(l), i2i))P(p(l))} (16) 

s/t Xpd) 2:p(2) P(S2i p(l)+ Q*(2 I p(l), Sli) p(2) <= |3 I p(l), p(2))P(p(l), p(2)) (17) 
<= e 

where £2*(2 | p(l), QO = arg min j 0(S2j) (18) 

s/t 2 p(2) P(Qi p(l)+ ^ p(2) <= |3 I p(l), p(2))P(p(2)) <= e (19) 
For the second transmission, 

i2*(2 I p(l), = arg min j a^) (20) 

s/t 2 p(2) P( Q*(l) p(l)+ p(2) <= p I p(l), p(2))P(p(2)) <- e (21) 



[0063] Note that P(p(2)) in (21) could be an updated pdf conditioned 
now on the latest prediction of p(2), aided by latest mobile channel 
report, as opposed to the P(p(2)) in (19) which is conditioned on the 
prediction of p(l). 

[0064] In an embodiment of the invention, the residual probability 
of error (PER) constraints ( (17), (19) or (21)) of bin packing may not 
have a solution. In such an embodiment, the largest amount of 
resource possible is allocated so that the amoimt by which the 
constraint is violated is reduced or minimized. In situations when 
cost function tie, the ties are broken in favor of the resource allocation 
that achieves lower probability of residual error, (i.e. excess energy 
per bit delivered albeit for the same cost). 

[0065] In an embodiment of the invention, the nature of the optimal 
approaching method is to allocate more resources at a decision stage 
(or transmission) that had a better charmel condition compared to 
others, given the same uncertainty or variance in the channel 
condition at each stage. Furthermore, if the chaimel conditions at two 
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different stages were comparable, then more resource would be 
allocated to that stage which had less uncertsiinty or lower variance 
about the ensuing channel condition. This is because less variance 
implies less margin of resources to guarantee a certain error rate. 
Hence, in an embodiment for a low Doppler user scheduled the first 
time with a reasonably good channel condition, most if not all of the 
resource allocation is aimed at the first transmission, whose channel 
condition is well predicted (as opposed to subsequent faraway re- 
transmissions whose channel report at the time of scheduling and 
channel condition during actucil transmission was less predictable). 
Conversely, in an embodiment for a high Doppler user where channel 
reports and predictions were essentially useless, the algorithm 
attempts to bcilance allocation of resources across all available 
transmissions, in order to gain increased or maximum time diversity 
and reduce or minimize total allocated resource in order to satisfy, for 
example, equations (17) eind (19) above. 

[0066] In an embodiment of the invention, an optimal-approaching 
choice of resource allocation, includes deterrmning at least one of the 
optimal-approaching resource allocations of (a) a first transmission of 
the packet, (b) the i^ transmissions of the packet (for k <= i < Mmax), 
and (c) the last transmission for k=Mmax. In another embodiment of 
the invention, the optimal- approaching choice of resource allocation 
for a first transmission is obtained by computing a minimum of a set 
of cost values subject to a constraint which is a function of a resulting 
frame error rate. 

[0067] In another embodiment of the invention, the set of cost 
values is obtained by teiking a sum of an actual cost associated with 
each choice of resource allocation for the first transmission and an 
estimated cost of any choice of resource allocation for transmissions 
two through Mmax which satisfy the constraint, including making a 
choice of resource allocations for the second through last 
transmissions, for every choice of resource allocation for the first 
transmission, compute the estimated costs associated with the choice 
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made for the second transmission through the Mmax transmission, 
computing the sum of the cost of all the estimated costs and the 
actual cost associated with the choice made for the first transmission 
to produce a set of costs, selecting values in the set of costs for which 
the frame error rate constraints are met, and selecting the minimimi 
value of the values selected as an optimal value. 

[0068] In another embodiment of the invention, the optimal- 
approaching choice of resoiorce allocation for the i^ transmission (for 
k <= i < Mmax) decided at any kth transmission stage (1 <= k <= Mmax) 
is obtained by computing the minimum of a set of cost values subject 
to a constraint fimction expressing the resulting residual error 
probability. In another embodiment of the invention, the set of cost 
values is obtained by taking the sum of fixed costs associated with 
transmissions 1 through (i-1) based on deterministic resource 
allocation and channel conditions, a cost of every choice of resource 
allocation for the i^*^ transmission, and an estimated cost of any choice 
of resource allocation for transmissions (i+1) through Mmax subject to 
satisfying the residual probability of error constraint function. 
[0069] In smother embodiment of the invention, the optimsil- 
approaching choice of resource allocation for the last treinsmission 
decided at any kth transmission stage (l<=k<=Mmax) is obtained by 
computing the rnlnlmum of a set of cost values subject to a constraint 
function expressing a resulting residual error probability. In another 
embodiment of the invention, the set of cost vedues is obtained by 
taking the sum of the actual costs associated with all previous 
transmissions and expected costs of every choice of resource 
allocation for the last transmission subject to satisfying the residual 
error probability constreiint. 

[0070] In another embodiment of the invention, the optimal- 
approaching choice of resource allocation includes iteratively 
obtaining optimal values of the resource allocation at every stage 
beginning with the Mmax transmission and worlsing baclswards to the 
kth transmission of interest. In another embodiment of the invention. 
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the optimal values of any stage i may be estimates or expectations, 
where at each step of the iterative process, resource allocations that 
do not satisfy the residual probability error constraint fimction are 
excluded. In another embodiment of the invention, the expectations 
are conditioned on deterministic resources and channel conditions 
measured, computed or assumed from stages previous to the i^ stage. 

Example Algorithm 2 - A Near-Optimal Algorithm 

[00711 Below are some examples of two transmissions maximum 
embodiments of the invention using a near-optimal-approaching 



method. 

Q*(l) = arg min i { CX^i) + (X^\2 | QO)} (22) 

s/t Z p(i) S p(2) P(S2iP(l)+ I p(2) <= P I p(l), p(2))P(p(l), p(2)) <= e (23) 

where Q.\2 \ Qi) = arg min j a[S2j) (24) 

s/t 2 p(i) 2 p(2) P(aip(l)+ S2jp(2) <= P I p(l), p(2))P(p(l), p(2)) <= 8 (25) 
For the second transmission, 

Q\2 I p(l), = arg min j a^) (26) 

s/t Z p(2) P(Q*(1) p(l)+ p(2) <= p I p(l), p(2))P(p(2)) <= e (27) 



[0072] This a priori estimate of the allocation Q#(2|ai) used in the 
computation of Q*(l) in (22) is an alternative and it is computed as an 
average over p(l) as well in (25). whereas, P(p(l)) of (25) may be a delta 
function a posteriori i.e. after p(l) event has been realized. Since the 
variation In aposteriori p(l) is assumed to exist, albeit perhaps less 
than optimal accurately with respect to the optimum-approaching 
embodiment, the overall resource allocation Q^{1] + i2*(2|Q^(l). p(l), 
p(2)) may be over-allocated. Furthermore, for the same reason of the 
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variation in a posteriori p(l) assumed existent, the resource allocation 
strategy of (22) will shift more resource towards relative to 

£2*(2|i2^(l), p(l), p(2)) compfired with the optimal-approaching resource 
balancing that exists between and Q*(2 1 p(l)). 
[0073] Due to combination of the above reasons, may be over- 

allocated compared to the optimal S2*(l) on the average. If the packet 
fails In the first transmission with Q*(l), then in the second 
transmission, the algorithm # will automatically allocate resources 
i2*(2) to correct for the over-allocation the first time. Also this time 
around it will do so by realistically assuming the perfect realization of 
p(l) as in (27) as opposed to its statistical knowledge as in (25). On 
the other hand, the over-allocation of resource in the first 
transmission with i2*(l) may cause it to succeed with more energy 
delivered per bit in the packet transmission than is necessary to 
maintain the residual PER, thus implying resource waste. One skilled 
in the art will appreciate that for a significant number of cases Q^(l) > 
S2*(l) + Q*(2) and on the average, it wUl be true that Q*(l) + i2*(2) >= 
S2*(l) + Q*(2). 

[0074] In an embodiment of the invention, a near-optimal- 
approaching choice of resource allocation, includes, at any said 
transmission of the packet, determining a candidate vector value of 
resource allocations of current and fijture transmission of the packet 
up to Mmax. In another embodiment of the invention, the resource 
allocation at any given stage is obtained by fixing any candidate vector 
value of resource allocations of the current and future stages up to 
Mmax, if a total probability of residual error for the c£indldate resource 
allocation is larger than a target, the candidate resource allocation is 
rejected as infeasible, among all feasible candidate resource 
allocations, selecting the one with a minimum total cost, in case of 
ties, breaking ties in favor of the resource allocation vector that has 
the lowest residual error probability, and setting a first element of the 
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resource allocation vector as the desired resource allocation for the 
current stage. 

[0075] In another embodiment of the invention, fixing any candidate 
vector value of resource allocations of the current and future stages 
up to Mmax includes fixing a vector value of the future channel 
conditions with an associated channel condition vector probability of 
occurrence obtained fi'om a Joint probability density function that 
models the channel during times of the impending current and future 
transmissions of the packet, finding the conditional residual error 
probability for each candidate resource allocation conditioned on the 
fixed future channel conditions by looking up a reference error curve, 
multiplying the conditional error probability with the channel 
condition vector probability, varying the channel condition vector and 
its associated probability and repeating said fixing, finding, and 
multiplying steps and accumulate probabilities for each iteration to 
yield a total probability of residual error, and summing the costs of 
the candidate resource allocation components to obtain a total cost of 
the candidate resource allocation. 

[0076] In another embodiment of the invention, the reference error 
curve is expressed as 0 if a desired signal-to-noise, due to the 
combination of fixed resource adlocation and past, present and future 
channel conditions, is exceeded and 1 , If not. 

Example Algorithm 3 - A Near-Optimal Algorithm (approximate 
Maximum Expected Throughput (MET)) 
[0077] An alternate embodiment may be expressed as follows. 

S211) = arg min i { a^ii) + 2 p(i) Z p(2) (XQ^(2 \ p(l), p(2))) P(p(l), p(2)) } (28) 

s/t Zp(i) Sp(2) P(^2i p(l)+ i2~(2 I a, p(l), p(2)) p(2) <= p I p(l), p(2))P(p(l), (29) 
p(2)) <= e 
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wKere Qr(2 \ Qi, p(l), p(2)) = arg min j Ciflj) 



(30) 



s/t P(Qi p(l)+ QjP(2) <= p I p(l), p(2)) <= E 



(31) 



For the second transmission, 



Q"{2 I p(l), £2"(1)) = arg min j 



(32) 



s/t 2p(2) P(Q~(l)p(l)+ p(2) <= P I p(l), p(2))P(p(2)) <= 8 



(33) 



[0078] This a priori estimate of the allocation Qr{2\Q,u p(l). p(2)) of 
(30) used in the computation of QTil) in (28) is an alternative, as it is 
computed assuming knowledge of p(2) as in (30), (31), whereas in 
reality, P(p(2)) is not a delta function. Since the variation in p(2) is 
assumed to be non-existent, albeit not necessarily tme, the overall 
resource allocation Q~(l) + Q12|Q~(1), p(l), p(2)) may be under- 
aUocated. Furthermore, for the same reason of the v£ulation in p(2) 
assumed non-existent, the resource allocation strategy of (28) will 
shift more resource towards S2"^(2 1£^~(1), p{l), p(2)) relative to Q~(l) 
compared with the optimal resource balancing that exists between 
a*(l) andQ*(2|Q*(l). p(l)). 

10079] Due to combination of the above reasons, i2~(l) may also be 
much under- allocated compared to the optimal ii*(l) on average. If the 
packet fails In the first transmission with QTil) then In the second 
transmission, the MKT method will automatically allocate resources 
Q~(2) to correct for the under-allocation the first time. The MET 
method will do so by realistically assuming the actual variations in 
p(2) as in (32), (33) as opposed to its knowledge explored in (30), (31). 
But the fact that it has to allocate a significant amoimt of resource in 
the last and only remaining transmission (i.e. without time diversity) 
may cause it to add margin and waste capacity or violate the residual 
PER constraint. 
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[00801 In an embodiment of the invention, a near-optimal- 
approaching choice of resource sdlocation, Includes, at any said 
transmission of the packet, determining a cemdidate resource 
allocation of the current transmission of the packet and candidate 
vector value of resource allocations of the future stages up to Mmax. In 
another embodiment of the invention, the resource allocation at any 
given stage is obtained by fixing a candidate resource allocation for 
the current stage, fixing ciny candidate vector vgdues of resource 
allocations of future stages up to Mmax, setting the total cost of 
resource allocation at the current stage as a function of the choice of 
current resource allocation to the expected minimum cost of the 
future allocation plus the cost of the choice of current resource 
allocation, among all feasible current resource allocations, selecting 
the one with the minimum total cost of resoiirce allocation, and in 
case of ties, breaking the tie in favor of the current resource allocation 
vector that has the lowest residual error probability, where the current 
resource allocation that results in a minimum total cost is the desired 
resource allocation for the current stage. 

[0081] In another embodiment of the invention, the step of fixing 
any candidate vector values of resource allocations of future stages up 
to Mmax, includes fixing a vector value of the future channel conditions 
with an associated channel condition vector probability of occurrence 
obtained firom a joint probability density function that models the 
channel during times of the impending current and future 
transmissions of the packet, finding the conditional residual error 
probability for the choice of current resource allocation and candidate 
future resource allocation conditioned on the fixed channel conditions 
by looking up a reference error curve, checking if the conditional error 
probability is greater than the target residual error probability £ind if 
yes, then the candidate future resource allocation is rejected as 
infeasible, if feasible, summing the costs of the candidate resource 
allocation components to obtain a total cost of the feasible candidate 
future resource edlocation, selecting among all feasible candidate 
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future resource allocations for that channel condition vector, the one 
with the least cost, wherein if no candidates cire feasible, selecting the 
largest resource allocation, multiplying the conditional error 
probability of the least cost future resource allocation with the 
associated channel condition probability, varying the channel 
condition vector and the associated probability and repeating the 
fixing, finding and checking steps and averaging the least cost 
resource allocation by weighing with the channel condition vector 
probability of each iteration and summing to yield the average cost of 
future resource cdlocation given the choice of resource allocation for 
the current stage, accumulating the product of the conditional error 
probability and channel condition probability over all channel 
conditions, and checking if the resultant is less than a target residual 
error probability, and if so, the choice of current resource allocation is 
feasible. 

[0082] In another embodiment of the invention, the reference error 
curve is expressed as 0 if a desired signal-to-noise, due to the 
combination of fixed resource allocation and past, present and future 
channel conditions, is exceeded and 1 , if not. 

Example Algorithm 4 — A Near-Optimal Algorithm (Maximum Likelihood MCS) 
[0083] Another alternate embodiment may be expressed as follows. 



Q^Xl) = argmaxhL(S2h) 



(34) 



where L(Qh) = 2 2: P(p(l), p(2)) 



(35) 



{(p(l), p(2)): ftni|p(l). P(2)) = iih} 



is computed for each i2h e J (resource space), where: 



Qni|p(l), P(2))= arg min i {CXfld + Ci^^"\2 \ i2i,p(l), p(2))) } 



(36) 
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s/t P(i2i p(l)+ QrX2 I Qi, p(l), p(2)) p(2) <= P I p(l), p(2)) <= e (37) 

where ^""^(2 | Qi, p(l), p(2)) = arg min j 0[^) (38) 

s/t P(Qi p(l)+ Qjp(2) <= P I p(l), p(2)) <= e (39) 
For the second transmission, 

^^^(2 I p(l), Qr\l)) = arg min j (40) 

s/t 2 p(2) P(Q"^^(1) p(l)+ Qj p(2) <= p I p(l), p(2))P(p(2)) <= e (41) 



[0084] Hie ML algorithm tries to target the MCS for each 
transmission that is "most likely" to consume the least amount of 
resources over any other MCS rather than the MCS that reduces or 
minimizes average resource consumption like the earlier algorithms 
£ind its variations. The algorithms #, - (approx METT) and ML are easy 
to implement in terms of on-line computation complexity and their 
implementation. 

[0085] As an example, the method of using the optimal-approaching 
embodiment may also be illustrated using three transmissions 
(maximum) as set forth below. 

Example Algorithm 4 - An Optimal Algorithm 
[0086] The following is an example of the optimal-approaching 
embodiment using three (3) transmissions maximum. 

S2*(l) = arg min i { a^^O + 2 p(i) a^\2 | p(l), i2i))P(p(l)) + 2 p(,) S p(2) a^\3 \ 
p(l), p(2), Qi, Q\2\ p(l), Qi)))P(p(l), p(2)) } 

(42) 
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s/t I. p(i) Z p(2)2 p(3)P(Qi P(l)+ ii\2 I p(l), fli) p(2) + Q*(3 I p(l), p(2), Qi, S2*(2 | 
p(l), QO) p(3) <= p I p(l), p(2), p(3))P(p(l), p(2), p(3)) <= e 

(43) 

where Q*(2 \ p(l), QO = arg minj{ a^) + 2 p(2) a^*(3 | p(l), p(2), Qj)))P(p(2))} 

(44) 

s/t Z p(2) 2 p(3) P(^2i p(l)+ S2j p(2)+ Q\3 I p(l), p(2), Qi, Q^)p(3) <= p 1 p(l), p(2), 
p(3))P(p(2), p(3)) <= e 

(45) 

where £2*(3 | p(l), p(2), Qi, Q*(2( p(l), a,)) = arg min k OQid (46) 
s/t 2 p(3) P(a p(l)+ Qj p(2) + P(3) <= p I p(l), p(2), p(3)) P(p(3)) <= £ 

(47) 

[0087] For the second and third transmissions, the above equations 
(44) and (46) for Q*(2 | p(l). and a\3 \ p(l). p(2), Q*(l), S2*(2| 

p(l), respectively based on the actually realized histories of the 

previous transmissions and based on the latest channel feedback 
based predictions, are the same maimer as the two transmission 
example for the optimal algorithm. In using the latest channel 
feedback, the stationary assimiption of the pdf expressed in (5) may 
be invoked. 

[0088] In an embodiment of the invention, a near-optimal- 
approaching choice of resoixrce allocation, includes, at any said 
transmission of the packet, deteriiiiriing a channel condition vector of 
the current transmission of the packet and future trsmsmission of the 
packet, having associated probability retrieved from a joint probability 
distribution function of channel conditions and determining a 
candidate vector value of resource allocation of the current 
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transmission of the packet and candidate vector value of resource 
allocations of the future stages up to Mmax. 

[0089] In another embodiment of the invention, the resource 
allocation at any given stage is obtained by fixing a channel condition 
vector for the impending and futiire transmissions with associated 
probability taken firom the joint probability distribution function of 
channel conditions, fixing any candidate vector value of resource 
allocations of the current and future stages up to Mmax, said fixing any 
candidate vector value of resource allocations further including 
finding the conditional residual error probability for the choice of 
current resource allocation and candidate future resource allocation 
conditioned on the fixed channel conditions by looking up a reference 
error curve, checking if the conditional error probability is greater 
than the target residual error probability and if so, then the candidate 
future resource allocation is rejected as infeasible. If feasible, 
summing the costs of the candidate resource allocation components to 
obtain a total cost of the feasible candidate future resource allocation, 
among all feasible candidate future resource allocations for the 
channel condition vector, choosing the one with the least cost, if no 
candidates are feasible, choosing the largest resource allocation, 
selecting the choice of current resource allocation of the least cost 
future resource allocations and the associated channel condition 
probability, varying the channel condition vector and an associated 
probability and repeat the finding, checking, summing, choosing, and 
selecting steps and accumulating the channel condition probability 
separately in all cases where the choice of current resource allocation 
coincide, and assigning the choice of the current resource allocations 
that is most likely to cause a least cost resource allocation overall as 
the current resource allocation with the largest accumulated channel 
condition probability. 

[0090] In another embodiment of the invention, the reference error 
curve is e^q^ressed as 0 If a desired signal-to-noise, due to the 
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combination of fixed resource allocation and past, present and future 
channel conditions, is exceeded and 1, if not. 

[00911 One of ordinary skill will realize these examples of optimal- 
approaching embodiments can be easily extended and generalized to 
the case of Mmax transmissions. A solution always exists to the 
problem for Mmax unbounded. Even if a user is in the worst c'(i) 
situation at all times, repeated allocation of the maximum () resource 
over large enough number of transmissions will guarantee success 
(bin fillin g) with less than the residual error rate desired for the 
chosen block size. However, such a solution may not exist if Mmax is 
restricted to be finite, and this situation may become apparent at 
some later decision stage than the first. In that situation, the cost 
minimization criterion is not relevant as the maximvim resource needs 
to be allocated from that point onwards. The residual PER constraint 
trsinsforms into one of reducing or rninimizmg the "inequality" 
difference i.e. to minimize the amount of constraint violation, or in 
other words, to achieve as low a PER as possible. 

[0092] It is noted that in exemplary embodiments of the invention, 
the packet size B(l) is considered an allocated resource for the first 
transmission only and is kept fixed thereafter, since this is the reality 
of stop and wait HARQ operation. It should be mentioned that the 
discrete formulation (bin packing) of the optimization problem and its 
solution takes into account the rate set granularity and all other 
quantized variables observed by the base station resource mamager. 
Thus, even if there was no randomness associated with the channel, 
the solution of the bin packing formulation provides gains simply 
because it chooses a set of transmission format from the discrete 
space of those permissible tailored to reduce or minimize resource 
consumption which may otherwise be excessive (due to coarse 
granularity related round offs). 
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Cost Function and Optimal (MET) MCS Selection algorithm 

[0093] In many of the exemplary embodiments described above, the 
notion of allocated resources at the i^ instant into a variable was 
abstracted and a cost cXQ(i)) associated to the allocation. In other 
exemplary embodiments of the invention, in EVDV, the base station 
decides the triple (B(l), W(l), T(l)). Likewise in HSDPA, the base 
station 120 decides the triple (B(l), W(l). m(l)) for the first 
transmission that is then signeiled to the access terminal 130 which 
then interprets this information also as a unique MCS Thus £2(1) 
corresponds to these very triples for the two systems respectively. For 
k > 1, B(k) = B(l) in both systems and is no longer an independent 
variable. Thus n(k), k > 1 corresponds to the pair (W(k), T(k)) in EVDV 
and the pair (W(k), m(k)) in HSDPA, which is also interpreted as a 
unique MCS ^(k). 

[0094] The true resources allocated to the user from a shared pool 
are the Walsh codes WO, the proportional fraction of power Ec/lorO and 
the time interval TQ. Thus, in an exemplary embodiment of the 
invention, a cost function for a given transmission may be expressed 
as the product of the code allocation and time allocation i.e. W(k)T(k). 
This cost is additive over a sequence of re-transmissions. 
Furthermore, it makes sense to normalize this cost over the benefit 
gained which is the number of bits transmitted i.e. the packet size 
B(k) = B(l). Hence we write: 

afl(k)) = W(k)T(k)/B(k) = W(k)T(k)/B(l). (48) 
In the case of HSDPA, since TTI is constant, 

OtQ(k)) = W(k)T/B(k) = W(k)T/B(l). (49) 

[0095] Note, the power resource allocation is captured implicitly in 
the Walsh code allocation as they are assumed directly proportional to 
each other. When summed over k, the cost takes the form: 
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Ska^iCk)) = (W(l)T(l) + W(2)T(2) + . . . W(M^x)T(M„^x))/B(l) (50) 

[0096] Cost is to be reduced or minimized and can be interpreted as 
reducing or minimizing the energy spent per successfully transmitted 
bit . Equivalently, an exemplary embodiment of the invention for 
increasing or maximiziag the reciprocal may be expressed as : 

B(l)/ (W(l)T(l) + W(2)T(2) + . . . W(Mn^)T(Mn,ax)) (5 1) 

[0097] An embodiment further normalized with respect to the 
available Walsh space Wavaii may be represented as: 

B(l)/ {(W(l)/ Wavail)T(l) + (W(2) AVavail )T(2) + ... (W(M„^) AVavail )T(M^x)} 

(52) 

[0098] In the denominator of (52), the time allocated to each 
transmission is amortized by the fraction of multi-code bandwidth 
allocated and summed. This can be thought of as the "effective" or 
equivalent transmission time of the packet of size B(i) undergoing 
HARQ transmissions. Thus, the expression is essentially an "effective" 
HARQ rate or throughput of the user. Hence, in an exemplary 
embodiment of the invention, reducing or minimizing the resource 
cost metric discussed is advantageous to increase or maximize the 
HARQ rate of the user. Either interpretation suggests that the multi- 
user system throughput is also increased or maximized. The 
optimization problem becomes one of maximizing expected throughput 
(MET) when the direct sum of the code time resources has replaced an 
expected value of the sum i.e. a probabilistically weighted sum of the 
code time resources. As explained, an expectation or averaging 
becomes necessary because the "future" allocations of resource, teiken 
into account during resource allocation at any stage, are random 
variables. Thus the metric to be maximized becomes: 
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(1) = B(l)/ E{ 2i=i W(i)T(i)/ Wavaii} (53) 

[0099] So B*(l), W*(l), T*(l) are chosen to yield maximum R^^^ *(1) 
for the first transmission. It is noted that B(l) is adso a parameter to 
be selected for the user in optimizing throughput subject to B(l) <= 
backlog in the user's buffers. For 1 < k <= Mmax re-transmission the 
metric to be maximized may be expressed as; 

(k) = B(l)/ E{ Li^k W(i)T(i)/ Wavaii I c(k), c'(l), c'(k.l), 111(1),..., fi(k-l)} 

(54) 

[00100] So W*(k). T*(k) are chosen to yield maximimi R^^^rq (k) for the 
for the transmission with B(l) = B*(l) already having been decided 
prior to the first transmission . 

[00101] In the online embodiment of the maximum expected 
throughput (MEXT) criterion of the MCS selection problem at a given 
instant of scheduling, stage by stage decisions of |ii*() up to the 
maximum permissible number of re-trfinsmissions that optimizes an 
expected throughput metric . is formulated as a function of 

BO,W{ ) and T( ) subject to certain QoS constraints (e.g. residual PER). 
Ties broken in favor of energy excess i.e. lower residual PER. 

^*(1) = (W*(1),T*(1), B*(l)) 

= arg mini { (WiTi + E[ 2^.2 W*(k|/(k-l))T*(k| /(k-1))/ Wavaii | c(l)])/Bi} (55) 

where I (k-1) is shortheind notation for information about resources 
allocated in this step for stages earlier than k as a function of (Wu Ti) 
and the channel conditions assumed fixed up to time k-1. 
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s/t P({2i,iEc/Ior(i)c'(i) T(i) Ym(jx(i), Ec/Ior(i)c'(i))}Yr(H(l), ^(2),.. Ji(M)3(l)) Yd /B(l) 
<= (Eb/Nof /Rchip ) <= e 

(56) 

s/t Bi <= backlog (or instead of picking an optimal B*(l). an exemplary 
embodiment may round the backlog to the nearest Bo permitted and 
set this equal to B(l)) (57) 

where (W*(k|/(k-l)), T*(k| /(k-1))) is defined (In a recursive fashion) as a 
function of (Wi, Ti) £tnd channel conditions up to time k- 1 being fixed. 

[i*(k) = (W*(k), T*(k)) for k > 1 (58) 



= arg min i { (WiTi + E[ Ej=k+i W*aUa~l))T*(jl ^O-l))/ Wavaiil c(k), c'(l), c'(k-l), 
^i(l),...,H(k-l)])/B*(l)} (59) 

where /(j-1) is shorthand notation for information about resources 
allocated previously or in this step for stages earlier thein j as a 
function of (Wi, TO and the channel conditions already known or 
assumed up to time j- 1 , 

S/t P({Si=i Ee/Ior(i)c'(i) T(i) Ym(^(i), EeAor(i)c'(i))}Yr(Kt(l), fx(2),..., KM),B(1)) Yd/B(l) 
<= (Eb/No)* /Rchip ) <= £ (60) 

where (W*(j|/(j-l)), T*(j| ^(j-1))) is defined (ui a recursive fashion) as a 
fimction of (Wi, TJ and channel conditions up to time j-1 being fixed. 
[00102] An exemplary embodiment of the present invention may 
invoke Bellman's Djniamic Programming principle of optimality for 
multi-stage decision problems. If the policy adopted at every stage is 
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optimal given the past decisions, then the policy is overall optimal. 
The proof method is hence inductive. 

Overheads 

[00103] One skilled in the art will appreciate that the overheads 
(power and codes) consumed by each user's transmission or re- 
transmission are accounted for in the resource cost. Without proper 
accounting for the overhead costs, the solution may favor distributing 
resources across too many re-transmissions potentially proving to be 
too costly and sub-optimal in actual operation with overheads. In an 
embodiment of the invention, the cost includes a cost of associated 
control ch£innel overheads. 

[00104] An exemplary embodiment of the invention also accounts for 
the control channel code usage by adding the Walsh code fraction it 
consumes into the W(i) term of the cost function. The overhead power 
value depends on the channel report based prediction c(i), T(i) cind a 
margin against Doppler induced predictor variance and is factored 
into the aggregate Es/Nt equation as an additional per-transmission 
penalty. Thus, the overhead power penalty for the control channel is a 
fimction of the c(l), T(i) and P(c'(i) |c(i)). However, since at the stage, 
c(i) is not considered other than c(k) as a variable, a simpler approach 
may be used rather than add complexity and dimensions by adding 
new random variables in the form of the channel reports c(i) into the 
SBP formulation. The control channel power overhead over all c(i)s 
weighted by the pdf P(c(i) | c'(i)) are averaged. That is, compute Ec/Ior 
c(T(i), c(i), P(c'(i) I c(i))) based on usual methods of looking up 
conditional long-term error curves for the desired control channel 
FER. Then computing, 
[00105] 

Ec/Ior '(T(i), c'(i)= a) = 2 Ec/Ior '(T(i), c(i) = p, P(c'(i)|c(i)=p)) P(c(i) = p|c'(i)=a) 

(61) 

where P(c(i)= P|c'(i)=oc) = P(c'(i)=a|c(i)=P)P(c(i)=p)/P(c'(i)=a) (62) 
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m 

which can be simplified by invoking statlonarity assumption as: 
P(c(i)= p|c'(i)=a) = P(c'(l)=a|c(l)=P)P(c(l)=P)/P(c'(i)=a) (63) 

[00106] The penalty due to the control channel appears as a 
reduction in the power fraction Ec/Ior(i) allocated to the packet data 
channel in the i^ transmission by an amount equal to the 
approximate power fraction demanded by the control channel Ec/Ior ^ 
(T(i), c'(i)) for its reliable decoding. So, in the general case of IR with 
overheads, by plugging in (8) into (6), the bin-packing inequality for 
success may be expressed as: 



(Ec/Io.avail(i) - EcAo/(T(i),C*(i)) ) W(i) c'(i) T(i) Ym(^i(i), (Ec/Ior avail(i) - 

EcAo/(T(i),c'(i))) (W(i)AVavail), c'(i))}Yr(|i(l), ^(2),..., ^(M),B(1)) Yd /B(l) >= 

(Eb/Nof Wavail/Rchip (64) 

[00107] Thus, from the toted Ec/Ior (i) sillocated to the user in 
proportion to the allocated nimiber of channelization codes, only a 
portion (after accounting for overheads) is useful for packet data 
transmission. 

[00108] At higher Doppler and lower c(i), the overhead power spent 
for control channel signaling can be quite high especially if the TTI is 
small. This is because the control channel among other things carries 
information about the user ID and has to be signaled with high 
reliability to keep the packet data channel throughput high, for each 
individual re-transmission. Thus, it does not have the benefit of 
extensive time diversity that the packet data channel enjoys due to 
Hybrid ARQ. Thus, the fade margin set aside for high Doppler users to 
meet long term control channel performance targets can be liigh. 
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[00109] The high overhead cost for high Doppler users would suggest 
favoring a strategy of allocating resources to the packet data to 
succeed with fewer re-transmissions. Note the main strategy of 
stretching the resources spent for the packet data channel over a long 
period of time via several re-transmissions such that the resource 
allocated for each re-transmission and overall is small due to time 
diversity advantage more important in one embodiment favoring a 
strategy of aiUocating resources to the packet data to succeed with 
fewer re-transmissions. In fact, if the overheads could be neglected for 
high Doppler users, the optimum resource allocation would follow a 
pattern of minimum resource allocated per re-transmission and to 
extend to as many transmissions as needed. Thus two opposing 
requirements for control and traffic collide and cause the optimum 
nxmiber of transmissions and resource allocated per transmission to 
be centered. 

Folding Rate Selection 
[00110] In the metric to be optimized, the code time product cost 
expected in the future by the number of codes available in each TTI 
and sum may be normalized. This yields an effective transmission 
time. The block size divided by this time 3aelds the effective HARQ rate 
^arq Q ^arq q bascd ou Only futurc transmission times and formats 
expected while including the accumulated energy from previous 
transmission instances. This HARQ effective rate can be directly used 
as the "channel rate" in the priority function computation of the 
scheduling algorithm. For example, 

Modified C-RMAX: Deadline = J W S/R(t) + a where R(t) = i^^^ (t) (65) 

where the parameters used are J for user's Job size, W for weighting 
function, S for stretch, a for arrival time etc. 

Modified Proportional Fair: Priority = R(t)/T where R = !^^\t) (66) 
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where x is a windowed user throughput measure. 

[00111] In an exemplary embodiment of the present invention, for 
first transmission comparisons, higher Doppler users who incur 
higher effective transmission times due to their dependence on time 
diversity lengthening will be accorded lower priority over low Doppler 
users with low mean prediction error, given sufficient granularity for 
the latter. However, as the treinsmissions proceed and energy 
accumulates for the high Doppler user, the user's priority comparison 
against low Doppler users becomes more and more favorable as 
energy accumulates making the user less susceptible to fading 
variations, and only future transmission time is counted to calculate 
the user's rate. Thus with time diversity gain accruing high Doppler 
users get scheduling preferential treatment on par with low Doppler 
users. The comparison of high versus low Doppler is based on the 
same channel condition feedback report. 

[00112] Higher Doppler users will likely enter into re- transmissions 
automatically due to the nature of the rate selection algorithms and 
get capacity gains due to Hybrid ARQ. This last gain is measvired by 
comparing the more "aggressive" instantaneous rate selection (or 
resource allocation) of Statistical Bin Packing algorithms and the net 
rate achieved after many such re-transmissions for a single packet to 
success against the conservative rate (or over allocation of resources) 
yielded by constxlting the long-term curve if only one transmission 
were permitted as in pure rate adaptation with the ssime residual PER 
constraint. 

[00113] In the Statistical Bin Packing optimal algorithm, the very last 
re-transmission permitted must have enough resources allocated to 
ensure the residual error rate. Hence, for the last permitted 
transmission alone, the algorithm effectively consults a conditional 
long-term error curve to satisfy the target error rate. The conditioning 
arises due to a given amount of pre- accumulated energy and the latest 
channel report that are known to the base station by monitoring the 
mobile's channel quality feedback. If there is reasonable amount of 
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energy accumulated (which is an additive term) already due to 
sufficient number of re- transmissions, then the final transmission 
channel fade will have less effect (since this is a multiplicative term on 
the next transmission only) and there is an achieved SNR floor. Hence 
the conditional long-term curve approaches a short-term curve, which 
has better performance - thus the time diversity due to multiple 
transmissions comes into effect. 

[001141 The above metric of "effective HARQ rate" i^^"^ also captures 
the "frame fill" or "packing" efficiency issues arising out of small sized 
packets or small backlogs, since the transmission time is amortized by 
the fraction of codes actusdly allocated at each stage. Thus 
accounting for partiail resource usage due to small packets. 

Available Walsh Space 
[00115] An implicit assumption in the optimality of an algorithm is 
the total available Wsdsh codes and Ec/Ior do not change over the time 
of packet transmission. Such an assumption for Walsh codes is 
reasonable considering the long time constants of its change 
compared to the typical packet treinsmission times. Variations due to 
power control in actual Ec/Ior available for the i^ transmission with 
respect to the average available over a packet transmission can be 
grouped together with the statistics of the variations In the 
corresponding c*(i)s and hence incorporated within the P(c'(i) | c(l)). 
[00116] Furthermore, it may be assimaed that a user has the ability 
to utilize all the available Walsh codes and Ec/Ior whenever he/ she is 
scheduled for each re-transmission. Thus the user is assumed as a 
first priority user in the "greedy fill" scheduling process (considered a 
good sub-optimal heuristic of low complexity). This is a fair 
assumption to make in making a comparison of all users while 
making scheduling decisions based on HARQ effective rates. Also it is 
a fair approximation if the first user typically occupies most of the 
available bandwidth in each TTI (i.e. he has significant though not 
necessarily infinite backlog in his/her buffers). 
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[00117] However, In rate selection (post scheduling), this assumption 
can be modified if so desired in the following manner. The first priority 
user's resource allocation is subject to the constraint that W*(i | ) <= 
Wavaii for all i, i.e. for i = current instant k and all future instants. 
However for the p^ priority user, the user's resource allocation is 

subject to the constrsdnt that: 

p-i 

Wp*(k| ) <= Wavaii - 2u=l Wu*(k| ) (67) 

i.e. remaining Walsh codes after greedy filling p-1 other users at the 
current instant of scheduling k becomes the new Walsh code limit and 

Wp*(i| ) <= Wavaii for all i > k. (68) 
i.e. such a greedy fill induced limit is ignored while considering 
scheduling of the p^ user in futirre time instants. A similar set of 
constraints appUes for the Ec/Ior allocation to greedy filled users. 
[00118] Such simplifications can be corrected if so desired, by 
including the available Walsh codes for each user as another random 
variable (and hence the varying priorities of that user etre accounted 
for) with known statistics and averaging the metric R^^q over the 
sample space. Of course, the MCS selection at each TTI is constrained 
by the available Walsh codes. The proportional variation in user Ec/W 
available may be subsimied into the c'(i) statistics. 

[00119] Also, the Ec/Ior allocation can be made into a separate 
quantized dimension, by uncoupling it from the constraint of having it 
to be proportional to the number of Walsh codes allocated. 
[00120] Figure 4 illustrates an exemplary flow chart of the operation 
of Statistical Bin Packing that involves both ofOine computed look up 
tables and online computations in real time. Regarding obtaining the 
joint pdf P( ) of a user in practice, at the beginning of a user's packet 
call when there is insufficient history about the user, one could stairt 
with a PsimO obtained from simulations categorizing the user by 
Doppler, average geometry and cell loading (i.e. number of voice users. 
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niimber of data users). Then as histoiy for the user PhistC ) builds up, 
one could apply a straight line Interpolation with appropriately chosen 
decay parameter X(i) and obtain: 

P( I ) = PsiU I ) + (l-X(t)) Phist( I ) (69) 



where A,(t) = Ae 



wt 



(70) 



[001211 As shown in block 402 of Figure 4. inputs c(O-) . c(l). 

c(k), c'(l) c'(k-l), Doppler, ft(l) (k-1), are provided, which are 

based on past realized and current estimated values of the channel 
and past decisions of the algorithm. For each possible future channel 

vector c'(k) c*(M) and allocated resource combination, the residual 

FER, cost and other associated parameters are computed at block 404 
and provided to one of the exemplary statistical algorithms described 
above at block 408. In block 406, empirically obtained conditional 
PDF P(c'(k), ... c'(M) I input. Doppler, loading, and scheduler 
information is provided to the exemplary statistical algorithm at block 
408 and In block 410, one or more criterion are provided to exemplary 
statistical algorithm at block 408. The exemplary statistical algorithm 
produces the modulation and coding scheme (MCS) fi *(k) for the 
transmission of interest at block 412, 

[00122] For each possible sample waveform evolution of the channel 
across re-transmissions, and for each possible resource allocation 
combination, there can be evaluated offline the accumulated energy 
after due penalties and hence the probability (indicator function) of 
success/failure along with the resource cost and store it in Table 1 
below that forms an exemplary data structure: 



W(1),T(1), 
W(2),T(2), 

W(M),T(M) 



(GwGt)^ th 
column 
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c'(l),c'(2),...c'(M) 


(S W(i)T(i), 

P(ZEs/Nt . 
penalties 

<=Eb/No*)) 






















Search 
subspace of 
specific row 
index 


and specific 
range of 
columns 




(Gc)^ th row 











Table 1 



[00123] Each cell in the above "key" matrix data structure is row 
indexed by an ensemble of specific channel conditions for M 
transmissions and column indexed by a particular resource allocation 
(MCS) combination for M transmissions. (Packet data size may also be 
included in the column index combination as also the modulation 
order m that substitutes for T(i) in the case of HSDPA). Thus, each cell 
corresponds to a specific combination of MCS transmitted over 
specific channel conditions. The total resource cost (along with the 
columnar components) and the Indicator function of success or 
failure, i.e. P(EEs/Nt . penalties<= Eb/No*) by checking inequality (6) are pre- 
computed and stored in the cell. This data stmcture is common to aU 
users that fall in the same Doppler category. 

[00124] In an embodiment of the invention, resource allocation 
determinations are made using the pre-computed matrix data 
stmcture of Table 1, containing the total resource cost and an 
indicator fxmction of success or failure or a conditional probability of 
failure, which is row indexed by an ensemble of channel conditions 
and column indexed by the resource allocation (MCS) combinations. 
In another embodiment of the invention, the pre-computed matrix 
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data structure may be defined, maintained and looked up for each 
type of chamiel possibly associated with a user, that includes doppler 
or rate of variation of channel conditions. 

[00125] In another embodiment of the invention, the pre-computed 
matrix data structure may be defined, maintained and looked up for 
each packet size possible in the communication system. In another 
embodiment of the invention, the pre-computed matrix data structure 
may be re-used for subsequent transmissions by using a subset of the 
data structure corresponding to the channel conditions and resource 
allocations of past transmissions. 

[00126] Gw is the chosen granxilarlty of the Walsh space allocation; 
for example if the available Walsh space is allocated as all of the codes 
or half of it then Gw = 2 levels. Gt the granularity of the TTI = 1 for 
HSDPA and 3 for EVDV. Gc is the granularity of the C/I report, which 
is taken as the granvilarity levels of the achieved C/I as well and 
equals 16 for EVDV and 32 for HSDPA. Of course, Gc can be made 
smaller If coarser granularity is desired for lower computational 
complexity. 

[00127] In other embodiments, it may be worth the tradeoff of 
reduced complexity for coarser granularity if significant proportion of 
the gains of bin packing can be obtained with coarse granularity. 
Furthermore, system performance may be better off with coarser 
granularity choices and good algorithms that effectively use them than 
with superfine granularity of rates, but lacking efficient algorithms to 
exploit them (with poorly formulated heuristics typically being the 
alternative). For the same reason (and also due to diminishing returns 
with increased diversity) the number of re-transmissions Mmax that 
figures in the exponent in the formula for the size of the key data 
structure could be restricted to a small number like two or three. 
[00128] As the re-transmissions proceed, the same data structure 
may be re-used for subsequent lookups, yet over a smaller dimension 
that is exponential reducing in the order of the remaining re- 
transmissions. The rows and columns of interest are obtained by 
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fixing the channel conditions and resource allocation of the already 
occurred past transmissions. Based on this structure, an exemplary 
embodiment of the invention may implement the optimal and/or sub- 
optimal (approx MET . ML) variations of Statistical Bin Packing 
Algorithm. 

[00129] In an embodiment for an approximate MET, the method for 
the case of maximum of two transmissions was set forth in (28) 
through (33). For the first transmission, an arbitraiy resource 
combination x i.e. (Wx, Tx) is chosen as a candidate. This fixes a group 
of colunms. Now for any ensemble of channel conditions, i.e. a fixed 
row index, we search over the group of columns to find a cell i.e. 
allocate resource combinations for the remaining transmissions such 
that a) the transmission has error rate better than the requirement 
and b) has minimum resource cost. If no cell meets a) then that cell 
having maxtmimi resource allocation and hence the lowest error rate 
is chosen. This process of finding the cell or allocation with minimum 
resoiorce cost is done for each row (i.e. channel condition) and the 
average resource cost and average error probability is computed using 
the probability of channel ensemble associated with each row. The 
whole procedure is repeated with a different resource combination y 
i.e. (Wy, Ty) as the candidate. That resource combination (W*, T*) 
whose average error probability is below the target and whose 
resource cost is minimimi is assigned for the first transmission. Ties 
in resource cost are broken in favor of resource combination with 
lower error rate. For subsequent transmissions k >1, the same 
principle is applied reciu-sively. Resources allocated to previous 
transmissions 1 through k-1 and their associated channel conditions 
that are now known to the base station essentially define and pre-flx a 
subgroup of rows and columns in the key data table. The best 
resource (in the approx MET sense) to allocate for the kth 
transmission by carrying out a similar search may be found, except 
that it is restricted within the subspace of intersecting rows and 
colimms pre-fixed by earlier transmissions. 
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[00130] Tlie ML rate algorithm for the case of maxiinuni of two 
transmissions was elaborated in (34) through (41), For the first 
transmission, fix any ensemble of channel conditions, i.e. a fixed row 
index. Search over the cells in this row to find the best cell i.e. 
resource combinations for the first and remaining transmissions such 
that a) the transmission has error rate better than the requirement 
and b) has minimum resource cost. If no cell meets a) then that cell 
having maximum resource allocation, and hence the lowest error rate, 
is chosen. The first transmission component of the resource allocation 
combination is logged as a row winner along with the channel 
ensemble probability associated with the row. Additional row winner 
parameters like the resource cost and the error rate achieved by the 
winning combination may also be logged. The same procedure is 
repeated for every row (channel condition ensemble) in the table and 
the row winners and associated row probabilities are logged. Finally, 
every resource allocation combination that can be used as the first 
transmission component is ranked according to the aggregate 
likelihood (by accumulating the row probability when it is a winning 
combination) of being a row winner. That resource combination with 
top rank i.e. maximum likelihood is allocated for the first 
transmission. Ties in top rank are broken in favor of resource 
combination with lower average error rate or resource cost. For 
subsequent transmissions k >1, the same principle is applied 
recursively. Resources allocated to previous transmissions 1 through 
k-1 and their associated channel conditions that are now known to 
the base station essentially define and pre-fix a subgroup of rows aind 
columns in the key data table. The best resource (in the ML sense) to 
allocate for the kth transmission may be fovmd by carrying out a 
similar search except that it is restricted within the subspace of 
intersecting rows and columns pre-fixed by earlier transmissions. 
(00131] In exemplary embodiment described above, the residual 
packet error rate PER < e constraint has been a recurring theme. 
However, this can be dropped altogether and instead the throughput 
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maximization Rharq metric can be modified by multiplying the bits 
transmitted over the by (1 -residual PER) achieved for a given resource 
allocation combination. 

(1) = E{[B(1)(1 - PER(W(1), T(l), .. ., W(M™,x), T(M^x)))]/ W(i)T(i)/ Wavaii]} (71) 

[00132] The step function approximation of the turbo code AWGN 
waterfall curve for reference MCS is not a necessity for the algorithm. 
The exact curve P itself can be used - so instead of checking the 
condition whether an aggregate Eb/No (weighted by penalties) exceeds 
a reference Eb/No. whose probability is then an indicator function 
(with result 0 or 1), the actual probabilily for that aggregate Eb/No 
may be looked up and used to check whether the failure probability is 
less than □ in all relevant formulas. 

[00133] Further as described above, the chase combining (CC) 
scenario described in (7) may not fit in the SBP formulation of (10). 
However, CC can be treated by an SBP formulation of slightly different 
form; silthough the analysis and resource allocation algorithms that 
follow use the ssime principles as for IR except that the dimensionality 
of resource allocation is considerably reduced. This is because the 
packet size, time and Walsh dimensions (that fixes the MCS) fixed for 
the first transmission remains fixed throughout all re- transmissions. 
So the resource allocation problem just reduces to one of figuring out 
the optimal format for the first transmission that minimizes resources 
across all transmissions taking into account control overheads etc. in 
a similar manner. 

[00134] As described above, the fact that only two CDM-ed users are 
allowed at a time in IxEVDV (with the additional constraint of the 
equality of their TTI T(i)) can be exploited in a special way by departing 
from the "greedy fill" approach, which was invoked for its simplicity in 
dealing with several CDM-ed users. The cost function could then 
depart from being based on individual user's code time resource 
product (or meiximum expected throughput) basis to a joint sum 
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throughput metric of two users. In doing so, key fsdmess conditions 
need to be preserved, such as the first priority user always transmits 
the largest number of bits possible in case of a new transmission. 
[00135] As described above in conjunction with the various 
exemplary embodiment of the present invention, the problem of 
deciding the "right" rate or MCS or aggression factor of a scheduled 
user in a packet data system using Adaptive and Asynchronous IR 
was discussed in detail and formulated as a statistical bin packing 
problem and solved for the criterion that maximized expected 
throughput. Exemplary embodiments of the present invention also 
describe sub-optimal methods and algorithms for other criteria were 
also described together with fast implementation strategies. 
Exemplary embodiments of the present invention using the rate or 
throughput determined by statistical bin packing algorithm in the 
scheduler priority function were also described. 

100136] As described above in conjunction with the various 
exemplary embodiments of the present invention, BTS/BSC are terms 
generally associated with North American systems, but the present 
invention may be used with any type of base station (such as the 
European "Node B") and any type of controller (such as the European 
"RNC"). 

[00137] The invention may be embodied in other forms without 
departing from its spirit and essential characteristics. The described 
embodiments axe to be considered only non-limiting examples of the 
invention. The scope of the invention is to be measured by the 
appended claims. All changes which come within the meaning and 
equivalency of the claims are to be embraced within their scope. 
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